Comparison of width-wise and length-wise language model compression

نویسندگان

  • Edward W. D. Whittaker
  • Bhiksha Raj
چکیده

In this paper we investigate the extent to which Katz backoff language models can be compressed through a combination of parameter quantization (width-wise compression) and parameter pruning (length-wise compression) methods while preserving performance. We compare the compression and performance that is achieved using entropy-based pruning against that achieved using only parameter quantization. We then compare combinations of both methods. It is shown that a broadcast news language model can be compressed by up to 83% to only 12.6Mb with no loss in performance on a broadcast news task. Compressing the language model further by quantization to 10.3Mb resulted in only a 0.4% degradation in word error rate which is better than can be achieved through entropy-based pruning alone.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Life-wise Language Learning Textbooks: Construction and Validation of an Emotional Abilities Scale through Rasch Modeling

Underlying the recently developed notions of applied ELT and life syllabus is the idea that language classes should give precedence to learners’ life qualities, for instance emotional intelligence (EI), over and above their language skills. By so doing, ELT is ascribed an autonomous status and ELT classes can lavish their full potentials to the learners. With that in mind, this study aimed to d...

متن کامل

A Look into the Association between Life-Wise Syllabus and Teacher Success: A Quantitative Study in an EFL Context

The current study aimed at investigating the relationship between life-wise syllabus- as a new type of syllabus which emerged as the result of recent developments in applied ELT and corresponding to the "education for life"- and teacher success in an EFL context. Life-wise syllabus introduced by Pishghadam in 2011 directed English teachers’ attention to give priority to life issues rather than ...

متن کامل

A Succinct N-gram Language Model

Efficient processing of tera-scale text data is an important research topic. This paper proposes lossless compression of N gram language models based on LOUDS, a succinct data structure. LOUDS succinctly represents a trie with M nodes as a 2M + 1 bit string. We compress it further for the N -gram language model structure. We also use ‘variable length coding’ and ‘block-wise compression’ to comp...

متن کامل

Fuzzy Linear Programming Method for Deriving Priorities in the Fuzzy Analytic Hierarchy Process

There are various methods for obtaining the preference vector of pair-wise comparison matrix factors. These methods can be employed when the elements of pair-wise comparison matrix are crisp while they are inefficient for fuzzy elements of pair-wise comparison matrix. In this paper, a method is proposed by which the preference vector of pair-wise comparison matrix elements can be obtained even ...

متن کامل

Providing a Model for Explaining the Galatia Effect on Auditor's Wise Decision Making

Entrance to the specialized audit profession requires to making decision and having distinct attributes and properties, which the capability of taking wise decision is one of them. The task of auditors as a specialized profession is opinion about financial statement that in turn to it requires to auditor’s neutrality in decisions. If auditors be independent from clients, in the best possible wa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001